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Abstract 

Future smart environments will be characterized by mul- 
tiple nodes that sense, collect, and disseminate informa- 
tion about environmental phenomena through a wireless 
network. In this paper, we define a set of applications that 
require a new form of distributed knowledge about the en- 
vironment, referred to as non-uniform information granu- 
larity. By non-uniform information granularity we mean 
that the required accuracy or precision of information is 
proportional to the distance between a source node (infor- 
mation producer) and current sink node (information con- 
sumer). That is, as the distance between the source node 
and sink node increases, loss in information precision is 
acceptable. Applications that can benefit from this type 
of knowledge range from battlefield scenarios to rescue 
operations. The main objectives of this paper are two- 
fold: first, we will precisely define non-uniform informa- 
tion granularity, and second, we will describe different 
protocols that achieve non-uniform information dissem- 
ination and analyze these protocols based on complexity, 
energy consumption, and accuracy of information. 

1 Introduction 

Smart environments are characterized by a large num- 
ber of distributed sensors that collect environmental in- 
formation and disseminate that information across wire- 
less links. Typically, sensor networks focus on collecting 
this information in a single place for analysis, taking into 
consideration optimizations from local aggregation (e.g., 

* An earlier version of this paper appeared in the International Con- 
ference on Networking Protocols (ICNP 2003) [151. Significant changes 
in this paper include the addition of an analysis of the protocol in the 
presence of mobility, as well as an expanded discussion section. 



LEACH 121) or processing data en route to a central lo- 
cation (e.g.. Magnetos 1121 '). While such central collec- 
tion is important for many applications, it does not match 
the requirements of all applications that can exploit sensor 
networks. 

For example, consider a military application with sen- 
sors distributed throughout an area collecting information 
about passing vehicles, air contaminants, land mines, and 
other environmental data. We assume the sensors can 
communicate with one another, and a soldier that moves 
throughout the region can contact any nearby sensor to 
find out both the state of that sensor, as well as any other 
information it has collected from the other networked sen- 
sors. For this soldier, clearly the events occurring in the 
immediate neighborhood are most important. For exam- 
ple, it is more critical to know about a land mine nearby 
than one several miles away. Nonetheless, it is still impor- 
tant that the soldier have a general overview of the area in 
order to plan and make appropriate decisions. Similarly, 
consider a rescue scenario where a team of fire fighters is 
working to rescue trapped victims. In this case, the fire 
fighters require precise information about their immediate 
surroundings in order to make decisions about using re- 
sources to make progress, as well as some global knowl- 
edge to plan a path to the victims as well as an escape 
path back to safety. In the above applications, sensors are 
static and mobile users connect to the nearby sensors to 
obtain the requked information. However, one can imag- 
ine a network where mobile users themselves are carrying 
sensors placed on them. In this paper, we focus on pro- 
tocols for static sensor networks (sensors are stationary), 
but then show that the protocols are resilient to mobility. 

The applications above differ from those typically stud- 
ied for sensor network applications in the following way: 
the information is not collected centrally, but instead it is 
utilized at several places in the network (e.g., the locations 
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of the individuals). While some sensor network applica- 
tions accomplish this in a query driven manner, asking a 
central source for the latest collected information, these 
applications require continuous updates. A simplistic so- 
lution to this problem is to proactively flood updates from 
each sensor to every other sensor. This solution is ex- 
tremely inefficient and does not scale to large numbers 
of sensors. In the scenarios we target, information from a 
particular sensor is most important to those surrounding it, 
with the value of the information decreasing as a function 
of distance from the sensor. Specifically, the necessary 
precision of information is proportional to the distance 
between an information producer and an information con- 
sumer We refer to such a requirement as a non-uniform 
information dissemination requirement, a new concept we 
introduce here. 

This paper introduces and analyzes several protocols 
that perform non-uniform information dissemination. As 
these protocols are intended to run on wireless sensor net- 
works, they must abide by the requirements of that envi- 
ronment, namely they must be energy efficient and have 
low complexity. The distinguishing feature of this new 
application class is that it is possible to trade accuracy of 
disseminated information for energy. Our experimental 
results clearly show this trade-off using a number of dif- 
ferent protocols. 

The remainder of this paper begins with a characteriza- 
tion of the requirements for non-uniform information dis- 
semination protocols. Section |3] describes the details of 
several protocols. In section|3 we discuss the implemen- 
tation details of the protocols within the ns-2 simulator 
and then present our experimental results, followed by a 
discussion in section |5] which presents more insight into 
our results. Section|5]describes related work and section^ 
presents conclusions and future work. 

2 Design Goals 

We propose using the following design goals for sensor 
network protocols for applications that have non-uniform 
information dissemination requirements. 

• Energy efficiency. As sensor nodes are battery- 
operated, protocols must be energy-efficient to max- 
imize system lifetime. 

• Accuracy. Accuracy is a measure of the sensing fi- 
deUty: obtaining accurate information is the primary 
objective of a sensor network. Accuracy is a metric 
that is application-specific both in terms of the appro- 
priate metric and the required fidelity level. There 
is a trade-off between accuracy, latency and energy 



efficiency. In the applications we target, it is accept- 
able for sensors to have information with low accu- 
racy about locations that are far away, but they should 
have highly accurate information about locations that 
are close by. Because of this non-uniform informa- 
tion dissemination requirement, a given sensor will 
not have all the information from all other sensors 
at every point in time. Consider a case where sen- 
sor Si receives every n*'^ packet from another sensor 
52. In this case. Si receives the i*'' packet from 52 
at time ti and the {i + n)*-^ packet from 52 at time 
t2, (n > 1). Thus, in the interval (^1,^2), the infor- 
mation Si has about 52 is not as accurate as a sensor 
that receives every update that 52 sends. We define 
accuracy in terms of the difference between the local 
value of the information and the actual value. 

• Scalability. Scalability for sensor networks is also a 
critical factor For large-scale networks, distributed 
protocols are needed such that the protocol is based 
on localized interactions and does not need global 
knowledge such as the current network topology. 

With these design goals in mind, in this paper we 
present simple deterministic protocols (Filtercast and 
RFiltercast) and non-deterministic protocols (unbiased 
and biased protocols) to achieve non-uniform informa- 
tion dissemination. Compared to flooding, these protocols 
reduce the cost of communication by reducing the num- 
ber of packet transmissions and receptions. At the same 
time, these protocols are designed to operate within the 
application-specific tolerance in terms of accuracy. Our 
results indicate that these protocols outperform flooding 
in terms of energy efficiency by trading-off accuracy for 
energy while keeping the accuracy acceptable by the ap- 
plication. The next section describes the details of each of 
these protocols. 

3 Dissemination Protocols 

This section introduces the mechanisms of several pro- 
tocols that perform non-uniform information dissemina- 
tion. Similar to traditional sensor networks, every sen- 
sor in the network serves as a source of information to 
be spread throughout the network. Unlike traditional net- 
works where a specific node serves as the sink node, every 
sensor in our system receives and stores some data from 
the other sensors in the system. 

We begin our protocol discussion with a traditional 
flooding algorithm. Flooding achieves uniform informa- 
tion dissemination, and serves as a baseline of compari- 
son for the rest of our protocols. Following this, we in- 
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troduce two new deterministic protocols and analyze two 
non-deterministic protocols lil3J . 

3.1 Traditional Flooding 

In flooding, a sensor broadcasts its data, and this is re- 
ceived by all of its neighbors. Each of these neighbor 
sensors rebroadcasts the data, and eventually each sen- 
sor in the network receives the data. Some memory of 
packets is retained at each sensor to ensure that the same 
packet is not rebroadcast more than once. If each sensor 
broadcasts its data, then with this flooding protocol, every 
sensor in the network will receive data from every other 
sensor. Thus, ignoring distribution latency, which is the 
amount of time required for a packet to travel from the 
source to the farthest sensor in the network, every sen- 
sor has an identical view of the network at every point in 
time (ignoring packet collisions and timing issues). Fur- 
thermore, the protocol itself is simple and straightforward 
to implement. Unfortunately the simplicity and high ac- 
curacy come at the price of high energy expenditure. The 
massive data replication requires active participation from 
every sensor in the network, and thus sensors can quickly 
run out of energy. 

3.2 Deterministic Protocols 

In analyzing the flooding algorithm, it is apparent that 
to achieve non-uniform information dissemination, one 
approach is to simply have intermediate nodes forward 
fewer packets. The two protocols we introduce here, Fil- 
tercast and RFiltercast, achieve just that by deterministic 
means. 

3.2.1 Filtercast 

As the name suggests, Filtercast filters information at each 
sensor and does not transmit all the information received 
from other sensors in the network. Filtercast is based on 
a simple idea of sampling information received from a 
given source at a certain rate n, specified as a parameter 
to the protocol. The lower the value of n, the more accu- 
rate the information disseminated by the protocol. When 
n = 1, Filtercast behaves identically to flooding. Dur- 
ing protocol operation, each sensor keeps a count of the 
total number of packets it has received so far from each 
source, sourcccnt- A sensor forwards a packet that it re- 
ceives from source only if (sourcecnt rriod n) ~= 0, 
then increments sourcecnt- We refer to the constant i as 
the filtering frequency. The intuition behind this protocol 
is that as the hop count between a source node and a sink 
node increases, the amount of information re-transmitted 



decreases due to the cascading effect of the filtering fre- 
quency at each subsequent sensor. 

While this reduces the total number of transmissions 
compared to flooding, the state information maintained 
at each sensor increases. Specifically, each sensor must 
maintain a list of all the sources it has encountered from 
the start of the application and a count of the number of 
packets seen from each of these sources. As this increases 
linearly with the size of the network, it may pose some 
scalability problems. 

3.2.2 RFiltercast 

One potential problem with Filtercast is the synchroniza- 
tion of the packets transmitted by the neighbors. For ex- 
ample, consider a scenario where sensors S2 and S3 are 
one-hop neighbors of both si and 54, while si and 54 are 
two hops away from each other. In this case, if Filter- 
cast is used as a dissemination protocol, then S2 and S3 
will end up forwarding either all odd or all even packets 
(synchronized on forwarding the packets) from si to 54, 
effectively transmitting redundant information. 

Our intuition is that if we can remove this redundancy, 
we may be able to increase the accuracy of the protocol 
without increasing the energy expended. 

To address this effect, we propose another proto- 
col: Randomized Filtercast (RFiltercast). In this vari- 
ant of Filtercast, the filtering frequency i is still the 
same for all sensors, but each sensor generates a ran- 
dom number r between 1 ... 71 and re-transmits a packet if 
{sourcccnt mod n) — r == 0. Intuitively, this means that 
each sensor considers a window of size n and will trans- 
mit only one of the packets from a given source in this 
window. So, for a window of size 2, half of the packets 
will be selected for re-transmission, but instead of always 
re-transmitting the first of the two packets (as in Filter- 
cast), the sensors that choose r = 1 will transmit the first 
of the two packets while the sensors that choose r — 2 
will transmit the second of the two packets. 

While our intuition was that the same energy would be 
expended by RFiltercast as for Filtercast, this turns out not 
to be true. In fact, RFiltercast transmits more packets than 
Filtercast, but fewer than Flooding, putting its energy ex- 
penditure in between the two. This effect happens because 
in RFiltercast, a node receives more packets from a given 
source, as described above, and thereby ends up transmit- 
ting more packets on behalf of the source. Note that for- 
warding decisions at an intermediate node are not based 
on packet IDs but are based on the number of unique pack- 
ets received from the source node. Therefore, due to the 
reception and transmission of more packets in RFiltercast, 
the energy dissipation of RFiltercast is higher than that of 
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Filtercast. 

While RFiltercast has more transmissions, increasing 
its energy expenditure, it also has improved accuracy over 
Filtercast. The crucial point to extract is that RFiltercast 
should, on average, propagate information faster than Fil- 
tercast, leading to more accurate data throughout the net- 
work, but RFiltercast will require less energy than flood- 
ing. 

3.3 Randomized Protocols 

Both RFiltercast and Filtercast are lightweight and easy to 
analyze due to their deterministic nature. However, they 
still have some overhead in terms of the state required at 
each node. 

We next describe several probabilistic protocols. In 
these protocols, when a sensor receives a packet, it 
chooses a random number and then decides whether to 
forward the packet or not based on the number chosen. We 
classify these protocols into two categories: biased and 
unbiased protocols. In the biased protocol, sensors bias 
their decision about whether to forward a packet based on 
the location of the source, where packets from close sen- 
sors are more likely to be forwarded than packets from 
distant sensors. In the unbiased protocol, all packets are 
forwarded with equal probability. 

3.3.1 Unbiased Protocol 

The notion of using probabilities to flood packets through- 
out a network has been studied previously fT/^ ll 1II17I . 
but to the best of our knowledge, no studies exist that ex- 
plore its applicability to non-uniform information granu- 
larity requirements. Similar to the deterministic protocols, 
the unbiased protocol also takes a parameter that affects 
the accuracy of the forwarding. In this case, the param- 
eter specifies the probability that a packet should be for- 
warded. In the case of the unbiased protocol, this value is 
the same for each incoming packet. 

The main advantage of this protocol is its simplicity and 
low overhead. As every packet is forwarded only with a 
certain probability, the protocol results in less communi- 
cation compared to flooding (proportional to the forward- 
ing probability). Also, the protocol does not require state 
to be kept, giving this protocol the potential to scale well. 

To adjust the accuracy of the information throughout 
the network, the forwarding probability can be tuned ac- 
cording to the application needs. The primary tradeoff, 
however, is energy for accuracy. In general, as the for- 
warding probability increases, the behavior converges to- 
ward flooding. While our current study considers only 
constant probabilities, in the future we will look at the 



possibility of probabilities being adjusted dynamically to 
adapt to the current network traffic and the application 
needs. 

3.3.2 Biased Protocol 

In this protocol, the forwarding probability is chosen to 
be inversely proportional to the distance the packet has 
traveled since leaving the source sensor In other words, 
if a sensor receives a packet from a close neighbor, it is 
more likely to forward this than a packet received from 
a neighbor much farther away. To estimate distance be- 
tween sensors, a sensor examines the TTL (time-to-live) 
field contained in the packet. If we assume all sensors use 
the same initial TTL, we can use the current TTL to adjust 
the forwarding probability for each packet. The follow- 
ing tuples indicate the forwarding probabilities used (sec- 
ond number in the tuple) when the packet has traveled the 
number of hops in the range specified in the first part of 
the tuple: < 1-3, 0.8 >, < 4-6, 0.6 >, < 7-9, 0.4 >, 
< 10+, 0.2 >. 

Using TTL to estimate distance is simple; however, 
note that TTL does not always indicate the exact distance 
between two sensors. For example, consider a source 
node S and a destination node D. It is possible that ei- 
ther due to congestion or collisions, a packet gets dropped 
along the shortest path and another packet reaches node 
D via a longer route. In that case, the TTL would give a 
false estimate of distance. However, in a static network, 
node D can always maintain its current estimate of the 
TTL to node S. In the case of mobile sensors, as the dis- 
tance between S and D changes (decreasing, for example, 
as the nodes get closer), this is reflected in the subsequent 
packets (higher TTL value) and thus node D gets more 
and more accurate information about S. We use the TTL- 
based approach for the biased protocol mainly for its sim- 
plicity, resilience to mobility and energy efficiency. 

Similar to the unbiased protocol, this biased protocol 
requires no additional storage overhead unless node dis- 
tances are stored, and the protocol itself is completely 
stateless (note, however, that this does not eliminate the 
caching of recently seen packets in order to avoid re- 
broadcasting the same packet multiple times). Therefore, 
this protocol scales as well. 

4 Experimental Study 

In order to analyze the protocols, we use the ns-2 discrete 
event simulator IIOI . Table[2lists the relevant parameters 
used during our simulations. 
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Table 1: Simulation parameters. 



Simulation area 


800 X 800 


Transmission range 


100 m 


Initial Energy 


10000 J 


MAC Protocol 


802.11 


Bandwidth 


1 Albps 


Transmit Power 


0.660 W 


Receive Power 


0.395 W 


Idle Power 


0.0 


Number of Nodes 


100 



In the case of static networks, we consider two sensor 
deployment strategies: uniform and random. In a uniform 
deployment strategy, sensors are distributed with some 
regular geometric topology (e.g., a grid). With random de- 
ployment, sensors are scattered throughout the field with 
uniform probability. For a battlefield-like scenario, ran- 
dom deployment might be the only option, but with appli- 
cations such as animal tracking in a forest, sensors may 
be deployed in a deliberate, uniform fashion. 

In order to simulate sensor readings, we divide the sim- 
ulation into an initialization phase and a reporting phase. 
During the initialization phase, each sensor chooses a ran- 
dom number between and 100 to serve as its initial sen- 
sor reading. During the reporting phase, each sensor in- 
crements its reading by a fixed amount at fixed intervals. 
In the real world, due to correlation among physically co- 
located sensors, sensors will have a different reading pat- 
tern; however, this simulation does provide us with valu- 
able information about the behavior of our protocols un- 
der various conditions. In the latter part of this section, 
we present a revised data model that tries to capture cor- 
relation among sensor readings. Our results indicate that 
the overall behavior of the protocols shows a very similar 
trend for both data models. 

4.1 Traffic Load Study 

This study focuses on evaluating the effect of a change 
in traffic load for both grid and random topologies. In the 
first set of experiments, we study the effect of varying traf- 
fic loads systematically from 5 packets/sec to 1 packet/ 2 
sec. The goal of these experiments is to understand the re- 
lationship between accuracy, reporting rate, and network 
capacity for both uniform and non-uniform dissemination 
scenarios. 

Note that in order to calculate accuracy, we find the dif- 
ference between a sensor's local view of another sensor's 



data and the actual value of that sensor's data. A view is 
essentially the latest data that one sensor has from another 
sensor. This view is then normalized based on distance. 
Let R{Si,j) denote sensor S^s view of sensor Sj's data, 
and let n be the total number of sensors in the network. 
The weighted error a for a sensor Si is given as: 

e^ = ^ - * (1) 
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where d{Si, Sj) is the EucUdean distance between sensors 
Si and Sj and 7 was set to 100 (the transmission range 
of each node). The fist equation shows that for a given 
sensor we calculate weighted average error with respect 
to all other sensors in the network. We vary the weight in 
terms of distance with a step size of 100 meters. 

Note that Euclidean distance is used as the weighing 
factor so that the higher the distance, the smaller the con- 
tribution of error toward overall error. This error calcu- 
lation describes our non-uniform data dissemination re- 
quirement by giving higher weight to errors for data that 
originated in a close neighborhood and lower weight to 
errors for data that originated from a distant sensor. It is 
worth noting that although we refer to this as error, be- 
cause the value of the data at the source increases linearly, 
it also represents the accuracy of the data. 

Our results indicate that with flooding, congestion is a 
severe problem, and other protocols are less prone to the 
congestion problem. In this type of application, the effect 
of congestion is worse than that observed in traditional 
sensor networks 1 14|. From the simulation studies, we can 
see that flooding is the least energy-efficient protocol and 
has the highest error if the network is congested. RFilter- 
cast and the biased protocol are more energy-efficient than 
flooding and provide low error in most cases. Filtercast 
and the unbiased protocol are the most energy-efficient 
protocols, but their accuracy is good (low error) only at 
higher sending frequencies. 

4.1.1 Grid Topology 

Figure ^ shows the performance of flooding, Filtercast, 
RFiltercast, and the biased and unbiased randomized pro- 
tocols under various traffic loads for the grid topology. In 
these graphs, distance is varied across the X-axis (in steps 
of 100 meters) and the Y-axis shows mean unweighted er- 
ror (mean absolute error). Note that, with non-uniform 
information dissemination, as the distance between the 
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source node and sink node increases, loss in information 
precision is acceptable. 

From Figure [T(a)l where the data rate is 5 packets/sec, 
we can see that even though theoretically flooding should 
have no error, due to congestion, flooding has the highest 
error. This is due to the fact that if the total traffic ex- 
ceeds the network capacity, congestion causes packets to 
be dropped and this gives rise to loss of information and 
high error At the same time, high traffic results in higher 
collisions. In this situation, even RFiltercast and the bi- 
ased randomized protocol result in high traffic load and 
thus they have high error as well. However, both Filter- 
cast and the unbiased randomized protocol (with forward- 
ing probability of 0.5) perform well in this case because 
the traffic load does not exceed the available network ca- 
pacity. As expected, for all protocols the error increases 
as the distance from the source increases, resulting in non- 
uniform information across the network. 

When the sending frequency is changed to 2 pack- 
ets/sec, as shown in Figure [T(b)l flooding the network still 
causes congestion and thus flooding has high error How- 
ever, now for both RFiltercast and the biased protocol, the 
load does not exceed the network capacity and their per- 
formance is better than in the previous case. Also, note 
that now these two protocols perform better in terms of er- 
ror rate than the unbiased protocol and Filtercast because 
of the fact that they disseminate more information yet the 
information disseminated does not exceed the network ca- 
pacity. 

When the sending frequency is lowered to 1 packet/sec, 
as shown in Figure [T(c)| then even flooding does not ex- 
ceed network capacity. Since the network is no longer a 
bottleneck, flooding disseminates the maximum informa- 
tion successfully and clearly has the lowest error Both the 
biased and RFiltercast protocols perform better than the 
unbiased protocol and Filtercast. The unbiased protocol 
and Filtercast have the highest error in this case because 
they do not disseminate as much information as the other 
protocols. The same trend continues even for the lowest 
sending frequency, shown in Figure [T(d)| 

The interesting point about these results is the oscil- 
latory behavior of the energy-error curves. To elaborate 
further on this, if the total data exceeds network capacity, 
then any further data on the channel will increase conges- 
tion and decrease overall lifetime of the network. When 
the amount of data transmitted is below network capacity, 
then there is a trade-off between energy spent and accu- 
racy observed. This is because as long as the total data 
does not exceed network capacity, sending more data will 
improve accuracy at the cost of energy spent in communi- 
cation. However, with non-uniform information granular- 
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Figure 2: Grid: Weighted energy-accuracy tradeoff. 

ity, accuracy between two sensors is proportional to dis- 
tance between them. Therefore, RFiltercast and Filtercast 
try to achieve this by filtering packets and the randomized 
protocols try to achieve this by probabilistically forward- 
ing packets. 

Figure |2] shows the trade-off between energy and 
weighted error, using the weighted error calculation 
method described in Eqns. [QandlJl In Figure |2l the X- 
axis indicates the energy spent in Joules and the Y-axis 
shows mean weighted error. Each point represents one of 
the sending frequencies, ranging from 5 packets per sec- 
ond for the left-most point of each curve to one packet per 
two seconds for the right-most. 

For flooding, when the reporting frequency is highest, 
the energy spent is maximum. However, as mentioned 
earlier, congestion and collisions cause high error As the 
sending frequency decreases to the point that total traf- 
fic does not exceed network capacity, the error also de- 
creases. Flooding performs the best in terms of accu- 
racy (minimum error) when the sending frequency is 1 
packet/sec; after this rate, the error starts increasing due to 
the fact that not enough information is propagated. This 
is an interesting phenomenon, where the error oscillates 
between these two bounds. The upper bound is a func- 
tion of the network capacity, whereas the lower bound is 
a function of the application-specific accuracy. Previous 
research has also shown this phenomenon 1 14|. 

Based on the energy-error trade-off, we can say that at 
high sending frequency, flooding performs the worst by 
spending high energy while not providing accurate infor- 
mation (high error). RFiltercast and the biased protocol 
start performing better than flooding at high rates. There 
is a considerable difference between energy and error for 
RFiltercast and the biased protocol compared to flooding 
at the sending frequency of 2 packets/sec. As one can 
anticipate, flooding performs better than all other proto- 
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Figure 1: Grid Topology: Mean absolute error as a function of distance for different source data rates. 
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cols in terms of accuracy when the sending frequency is 1 
packet/sec, but note that there is not much difference be- 
tween flooding, RFihercast and the biased protocol even 
when the network is operating in the non-congested mode. 
Filtercast and the unbiased protocol perform best in terms 
of energy and error at high sending frequencies and their 
performance relative to the other protocols starts to de- 
grade as the sending frequency is reduced. 

The desirable mode of operation for a protocol is in 
the region where minimum energy is spent and low er- 
ror is observed. Note that the desired mode of operation 
for a protocol depends on factors such as network density, 
transmission range of the radios, etc. In our future work, 
we will perform an analytical study to address this issue. 
From Figure|2j this zone lies around sending frequency 2 
packets/sec to 1 packet/sec for RFiltercast and the biased 
protocol, whereas for flooding it lies at sending frequency 
1 packet/sec. We want to point out that as the network size 
increases, flooding can pose severe problems in terms of 
scalability and energy efficiency. Therefore, randomized 
protocols should be considered as viable alternatives in 
these cases. In our experiments we had a network of 100 
sensors, but with a network of thousands of sensors we 
believe that randomized protocols will perform much bet- 
ter than flooding. With randomized protocols, the biased 
protocol performs the best by spending moderate energy 
and getting high accuracy. 

Our results show that randomized protocols can achieve 
high energy savings while at the same time achieving ac- 
ceptable accuracy with almost no overhead. Also note that 
RFiltercast and the biased protocol have almost equivalent 
error curves while the biased protocol has negligible over- 
head. 

4.1.2 Random Topology 

Figure |3] shows our results with a random topology and 
the same traffic loads as before. The results for 1 and 2 
packets/sec and the energy tradeoff study show expected 
results similar to those achieved for the grid topology, and 
are therefore not shown here. It is not clear whether regu- 
lar deployment will offer advantages over uniformly dis- 
tributed random deployment; if it does not, random de- 
ployment is preferable because of its low cost. 

4.1.3 Transmission Range 

Our next set of experiments show the effect of an increase 
in the transmission range from the original 100 meters to 
150 meters, while keeping the original 10x10 grid topol- 
ogy. An increase in transmission range corresponds to an 



increase in the degree (connectivity) of a sensor. This re- 
sults in decreasing the capacity of the network, meaning 
congestion occurs even at low sending frequencies. In- 
tuitively, this will make the overall situation worse if the 
network is operating in a congested mode. This can be 
seen from our results, comparing Figure s [l ( a)| and |4(a)| as 
there is an increase in overall error for flooding, RFilter- 
cast and the randomized protocols. 

In this case, even at the low sending frequency of 1 
packet/sec shown in Figure |4(c)| flooding does not per- 
form well due to network congestion. Previously, when 
the transmission range was 100 meters, flooding per- 
formed well at this sending frequency (see Figure [T(c)l i. 
However, when the network is not congested, then due 
to the higher connectivity and shorter average hop length, 
the average error decreases. For example, for RFiltercast 
and the biased protocol, when the sending frequency is 1 
packet/sec, then the maximum absolute error values with 
a transmission range of 100 meters are 0.4 and 0.6 re- 
spectively, as shown in Figure [T(c)| With the transmis- 
sion range changed to 150 meters, Figure |4(c)| shows that 
the maximum absolute error for RFiltercast and the biased 
protocol changes to 0.24 for both. 

Similarly, all the protocols have low error values at a 
sending frequency of 1 packet/ 2 sec when the transmis- 
sion range is 150 meters, as shown in Figure |4(d)| com- 
pared to the simulations with 100 meters transmission 
range, shown in Figure [T(d)l 

Up to this point our study considered static networks. 
In the next subsection we analyze protocols for non- 
uniform information dissemination in the presence of mo- 
bility along with a revised data model. 

4.2 Mobility Study 

To motivate the case for mobile sensors, consider a bat- 
tlefield scenario, where soldiers and armed vehicles are 
moving carrying tiny sensors along with them. Each sen- 
sor is collecting information about air contaminants so as 
to find out about potential biological/chemical attacks. In 
this case as a soldier moves around, the presence of an 
air contaminant sensed by the sensor changes depending 
upon the sensor's current location in the battlefield. Also, 
for a small change in location, there is not a very high 
change in the percentage of contaminant reported. We 
can think of this as a spatio-temporal process, where there 
is both spatial and temporal correlation among the read- 
ings reported by the sensors. This means that correlation 
among the sensor readings is a function of the distance 
between them; the closer the sensors are, the higher the 
correlation between their data. 
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Figure 3: Random Topology: Mean absolute error as a function of distance for different source data rates. 



In order to model this application, we divide the simu- 
lation area of 800x800 meters into 16 squares, each called 
a zone. We assume that the sensor readings (contaminant 
in this case) follow a normal distribution in space (inter- 
zonal distribution). For the inter-zonal distribution, we 
set the mean to 20 and the standard deviation to 2. This 
corresponds to a loose correlation among data sensed by 
all the sensors in the battlefield. Also, there will be very 
high correlation among data reported by sensors within 
the same zone. We modeled the correlation among sen- 
sors within a zone (intra-zonal) to follow a normal distri- 
bution but with low variance compared to that of the inter- 
zonal distribution. For the intra-zonal distribution we set 
the standard deviation to be a random number in the inter- 
val of to 0.5 (both inclusive). Also, we vary the mean of 
the intra-zonal distribution as the simulation progresses to 
reflect the temporal variations. During the initial half of 
the simulation, the mean slowly increases and then during 
the later half of the simulation it decreases gradually. For 
evaluating the performance of these protocols, we calcu- 
late the weighted error in the same way that we did for the 
static networks (e.g., Eqns.^andO. We want to empha- 
size that these zones are just an artifact of modeling the 
phenomenon (contaminant presence across the field). 

For static networks, distance between every pair of sen- 
sors is fixed (time invariant). However, in the case of mo- 
bile sensors, as the sensors move around, the distance be- 
tween a pair of sensors changes. By non-uniform informa- 
tion granularity, we intuitively mean that a sensor should 



have very precise information about its local neighbor- 
hood and loss in accuracy should be proportional to the 
distance between source and sink sensors. However, with 
mobile sensors, as the sensors move, the local neighbor- 
hood of a sensor changes as a function of time. It is thus 
interesting to study how the protocols (both deterministic 
and randomized) react to these neighborhood changes. 

We now analyze the performance of all the protocols 
when nodes are mobile with the revised data model. For 
mobility we consider the following two cases. In the first 
case we set the maximum speed of the sensors to 2 m/s 
(FigureO and in the second case to 10 m/s (Figure|6|l. The 
former model represents walking speeds (e.g., soldiers) 
while the later one represents vehicle speeds (e.g., tanks). 

The results presented here are the average of runs over 
3 random topologies. The error calculation is done based 
on the distance between two nodes at the time of read- 
ing. Note that for the static network simulations we previ- 
ously discussed, nodes chose a random number between 
and 100 as their initial value and then during the report- 
ing phase, each sensor incremented its reading by a fixed 
amount (10 each second) at fixed intervals. In the revised 
data model, the variations in the sensor readings are not so 
high. Thus, the results with the revised data model and the 
initial data model are not comparable numerically per se. 
However, one can clearly see the same trend in the relative 
performance of the different protocols. Figures ( |5(a)| and 
|5(b)| l and ( |6(a)||6(b)| and |6(c)| l show that RFiltercast and 
both the randomized protocols perform better than or very 
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(c) Data rate 1 packet/sec. 
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Figure 4: Tx= 150 m (Grid Topology): Mean absolute error as a function of distance for different source data rates. 
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Figure 5: Mobile sensors (speed 2 m/sec): Mean absolute error as a function of distance for different source data rates. 
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Figure 6: Mobile sensors (speed 10 m/sec): Mean absolute error as a function of distance for different source data 
rates. 
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close to that of flooding while the error value for Filtercast 
is high. Similar to its performance in static networks, for 
low data rates, flooding starts performing better in terms 
of accuracy than all the other protocols. 

Figure shows the trade-off between energy and 
weighted error, using the weighted error calculation 
method described in Eqns.^andlJlforboth mobility cases. 
In these figures, the X-axis indicates the energy spent in 
Joules and the Y-axis shows mean weighted error The 
trends observed are very similar to that of the static net- 
work (Figure|2ji. At high data rates, flooding spends maxi- 
mum energy and has the highest error Note that for higher 
mobility (Figure |7(b)t , the performance of flooding is 
worse than that of the low mobility case (Figure |7(a)^ . In 
the case of high mobility, the performance of RFiltercast 
and the biased randomized protocol is better than flooding 
except for the lowest data rate. These results indicate that 
the protocols are resilient to mobility, as changes in speed 
have almost no impact on the error values of the protocols 
for a realistic data model. 

5 Discussion 

Overall from these results, we can conclude the following: 
in the case of applications that can exploit non-uniform 
information, protocols can be designed to make efficient 
use of the available bandwidth while providing the nec- 
essary level of accuracy. Generally, RFiltercast outper- 
forms Filtercast when the network is not congested. Also, 
naive, randomized protocols such as the unbiased proto- 
col, outperform specialized protocols such as Filtercast. 
This is because in general these protocols forward mes- 
sages more aggressively with the parameters we selected. 
In our setting, since accuracy is a function of distance, the 
errors for far sensors count less and thus overall these pro- 
tocols perform well. The biased randomized protocol has 
comparable performance to that of RFiltercast. 

In the simulations presented here, the biased protocol 
performs better in terms of accuracy than the unbiased 
protocol, even for distant sink nodes. This effect is simply 
due to the forwarding probability settings and, most im- 
portantly, the size of the simulated networks, which lim- 
its the maximum number of hops between the source and 
destination. As mentioned previously, in the biased pro- 
tocol sensors transmit packets from nearby sensors with 
high probability, and this probability decreases linearly 
as a function of the number of hops between the source 
and the sensor transmitting the packet (see section U. 3. 2l l. 
However, in the case of the unbiased protocol, for any 
packet that needs to be forwarded, the forwarding prob- 
ability is constant (0.5 in our simulations). In our simu- 



lations, the maximum number of hops is limited to six, 
since we could not run larger simulations due to computa- 
tional resource constraints. Therefore, for the simulations 
presented here, the biased protocol has higher forward- 
ing probability than the unbiased protocol for the first few 
hops (with respect to the source), which dominates the 
picture. We conjecture that if we increase the number of 
hops (to, say, 20), then for distant sinks, the unbiased pro- 
tocol will perform better than the biased protocol in terms 
of accuracy. Note that both Filtercast and RFiltercast have 
some overhead to maintain the source lists and the count 
of how many packets a given source node has transmit- 
ted. On the other hand, the randomized protocols do not 
require such state to be maintained. Also, the random- 
ized protocols are resilient to mobility. We believe that 
randomized protocols with intelligent adjustments of for- 
warding probabilities can be considered as the most effi- 
cient alternative for non-uniform data dissemination. 

6 Related Work 



Recently, sensor networks have drawn a considerable 
amount of attention from the research community. Most 
of this existing work focuses on two primary cases: (1) 
Sensors send their data toward a central base station that 
has infinite power and is responsible for all data process- 
ing, and no in-network processing is done. (2) Sensors 
do some in-network data processing such as data fusion 
and this high level data is sent to the central base station. 
A number of such approaches have been proposed (e.g., 
mo ID)' However, in our case, we do not assume the 
presence of any such base station, and the sensors dis- 
seminate information among themselves so that the user 
can connect to any of the sensors to extract network infor- 
mation. 

Other studies considered specific sensor network appli- 
cations and their implication on protocol design. Cerpa et 
al. 1 2 1, have considered habitat monitoring and have de- 
signed protocols to match the appUcation need. Heinzel- 
man et al. |5|, described adaptive protocols for informa- 
tion dissemination. In this work, to save energy, sensors 
send out advertisements for data they have, and they only 
send the actual data if it is requested by one or more nodes. 
In previous work fl31, we described probabilistic flood- 
ing alternatives. However, the main goal of that work 
was congestion avoidance rather than non-uniform infor- 
mation dissemination. 

Li et al. 1 8 1 proposed a gossip-based approach for rout- 
ing protocols to reduce routing overhead. However, the 
study focused only on routing messages (with implicit 
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Figure 7: Mobile sensors: Weighted error-energy tradeoff. 



uniform information granularity requirement). Recently, 
Barette et al. 1 1 1 proposed a family of gossip-based rout- 
ing protocols for sensor networks. In their study they con- 
sidered various parameters such as the number of hops be- 
tween the source and the destination, the number of hops 
the packet has traveled, etc. 

In the DREAM 1 11 1 routing protocol, routing tables are 
updated based on the distance between two nodes and the 
mobility rate of a given node. While this work has a sim- 
ilar flavor to our work, exploiting non-uniform informa- 
tion needs, it is limited to only adjusting routing tables 
and does not apply to the actual data that is exchanged 
between two nodes. 

Kempe et al. 0| presented theoretical results for gossip- 
ing protocols with resource location as a motivating prob- 
lem and delay as the primary consideration. In their set- 
ting, a node at distance d from the origin of a new informa- 
tion source should learn about it with a delay that grows 
slowly with d and independent of network size. They do 
not consider application level performance criteria such as 
accuracy, which is part of our study. 

In this paper we have considered flooding as one of 
the alternatives for data dissemination in sensor networks. 
However, flooding and its alternatives have also been ex- 
plored in the context of mobile ad hoc networks. Perkins 
et al. describe IP Flooding in ad-hoc networks (6| . While 
this paper considers probabilistic flooding protocols for 
sensor networks, Sasson et al. have studied probabilis- 
tic flooding for ad hoc networks 1171 and used the phase 
transition phenomenon as a basis to select the broad- 



casting probability. Williams et al. |16| described and 
compared several broadcasting protocols (including prob- 
abilistic protocols) in the context of mobile ad hoc net- 
works. 

The primary difference between our work and existing 
work is the application requirement. In our study, we fo- 
cus on a new application requirement, non-uniform infor- 
mation dissemination, and we analyze protocols for this 
class of applications. 



7 Conclusions and Future Work 

In this paper we considered sensor network applications 
where events need to be disseminated to observers that 
may be present anywhere in the sensor field. For such ap- 
plications, simply flooding all the data is extremely waste- 
ful. Therefore, we defined the idea of non-uniform in- 
formation dissemination to capitalize on the fact that the 
value of data is typically highest for observers that are 
closest to the source of the data. We developed and an- 
alyzed several protocols to accomplish non-uniform dis- 
semination, both deterministic (Filtercast and RFiltercast) 
and non-deterministic (unbiased and biased) protocols, 
and we evaluated them under various traffic loads and 
transmission ranges and with or without mobility. In all 
cases, the developed protocols were clearly superior to 
simple flooding, both from an application and a network 
perspective. With flooding, congestion appears to be a 
limiting constraint and further, flooding is not generally 
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energy-efficient. Our results indicate that the performance 
of RFiltercast and the biased randomized protocol is al- 
most equivalent. RFiltercast requires each sensor to main- 
tain some extra state information, whereas the biased ran- 
domized protocol is completely stateless and has negU- 
gible overhead. Also, we note that the performance of 
Filtercast and the unbiased randomized protocol is almost 
equivalent. We also showed that RFiltercast as well as the 
randomized protocols are resilient to mobility. 

While in this paper the distance between two nodes is 
used as a parameter for non-uniform data dissemination, 
in our future work we will focus on a broad range of ap- 
plications with a non-uniform information dissemination 
requirement, where factors other than distance, such as 
importance of the information and confidence in the gen- 
erated data, can be used. Also, we would like to develop 
protocols that will tune the forwarding probabiUties dy- 
namically depending upon factors such as traffic load, net- 
work connectivity, resources (remaining battery power), 
etc. We believe that this will be an important step towards 
making these networks self-configuring. 

We would also like to develop a priority -based protocol 
where a source marks all its outgoing packets with a cer- 
tain priority to indicate the importance of the information 
contained in the given packet. Any forwarding node can 
consider the priority of the packet when making its for- 
warding decision. These techniques will extend the appli- 
cability of non-uniform information dissemination to new 
classes of applications for wireless sensor networks. 
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